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Abstract — We consider hard-decision iterative decoders for 
product codes over the erasure channel, which employ repeated 
rounds of decoding rows and columns alternatingly. We derive the 
exact asymptotic probability of decoding failure as a function of 
the error-correction capabilities of the row and column codes, the 
number of decoding rounds, and the channel erasure probability. 
We examine both the case of codes capable of correcting a 
constant amount of errors, and the case of codes capable of 
correcting a constant fraction of their length. 

I. Introduction 

One of the simplest methods of combining two codes is 
the product construction. Let C\ and C2 be [ni, k\, di] and 
[712,^2,^2] linear codes respectively. Then, the set of n\ x 
712 arrays whose columns are codewords of C\ and whose 
rows are codewords of C2, is the product code C\ ® C2 with 
parameters \n\U2, k\k,2, ditfc]. 

Product codes are useful in a variety of applications (for 
references see [1]). They may be found in the ubiqui- 
tous CD standard IEC-908 and CD-ROM standard ECMA- 
130 (for details see [2]), as well as the DVD standard 
(www . dvdf orum . org). Their rectangular shape makes them 
especially appealing to two-dimensional error-control applica- 
tions (for references see [3]). 

The fact that product codes retain entire codewords of their 
constituent codes makes it tempting to use an iterative decoder 
in the following fashion. First, each of the columns is decoded 
using a decoder for C\. The resulting (partially) decoded 
array is then used for a new round of decoding in which 
each row is decoded using a decoder for C2. This process 
may be carried for any number of rounds deemed necessary, 
alternatingly decoding rows and columns. It is therefore a 
natural question to ask what is the decoding-failure probability 
of such a scheme. 

This probability is obviously a function of the amount of 
errors correctable by the columns and row codes, the number 
of decoding rounds, and the channel error probability. The 
channel need not be binary, and we assume that it introduces 
an error in a certain position independently of other positions, 
and with the same distribution. We use the row and column 
decoders as black boxes with the only assumption being that 
they do not misdecode, i.e., they either correct all the errors 



or do nothing. This is the case when we take the g-ary erasure 
channel. 

In this work we calculate the asymptotic decoding-failure 
probability. This is a rare case where we can precisely predict 
the performance of iterative decoding analytically for a specific 
code rather than a random ensemble. We manage this by 
recasting the problem to a graph-theoretic setting in which the 
channel is thought of as producing random bi-partite graphs. 
By doing so, we can rely on well-known mechanisms for 
checking properties of random graphs. 

The paper is organized as follows. In Section [H] we give 
some necessary background on random graphs. We continue 
to Section |ni| in which we examine row and column codes 
which can correct a constant number of errors. This will be 
referred to as the constant error correction case. In Section 
IIVI we handle the case of row and column codes capable of 
correcting a constant fraction of their length, which we will 
call the linear error correction case. We conclude in Section 
[V]with a discussion of the results. 

II. Background 

Let us examine the case where we have a product code with 
codewords of size n x 77, and the column code and row codes 
are each capable of correcting t errors. After passing through 
the channel, a received word may contain errors. We can 
represent these errors as a bi-partite graph G — (Vl, Vr, E) 
with a vertex in Vl — {1, . . . , n} for each row, and a vertex 
in Vr = {n + 1, . . . , 2n} for each column. An edge is 
in E if and only if position j — n) is in error. 

Let 77, £ and ur be positive integers, and let < p < 1. The 
random bi-partite graph, G[ul, nR,p), is a probability space 
over the set of bi-partite graphs on the vertex set V = VlUVr, 
where \Vl\ = til, \Vr\ — ur, Vl D Vr = 0, and with 



Pv[(i,j)eG] = 



i G V L ,j £ Vr 
otherwise, 



with these mutually independent. Hence, we can think of our 
channel as producing a random bi-partite graph G(n, n,p). 

A round of decoding consists of an attempt to decode either 
all the rows, or all the columns. When viewed in the bi-partite 
graph representation, a decoding round consists of going over 
either the vertices of Vl or the vertices of Vr, and for each 



vertex with degree less or equal to t, removing all of its 
adjacent edges. Successive rounds alternate between rows and 
columns. We assume, w.l.o.g., that the last round is always 
performed on the rows. We denote the number of rounds as 
r, where r is a constant. 

Given a bi-partite graph, G, representing the transmission 
errors, we say that it is (r, t)-decodable if a decoder which 
can correct up to t errors in each row and column, corrects 
all the errors after r rounds. In other words, after r rounds of 
edge removals as described above, no edges remain. Our aim 
is to analyze the asymptotic probability of decoding failure. 

Throughout this paper we follow the notation of [4]. Given 
two functions, f(n) and g(n), we say that f(n) <C g{n) if 
/(n) = o(g(n)). Let G be a graph, and let A be a graph- 
theoretic property. If G has property A, we denote it by 
G \= A. In our case, A is the property that the graph is 
(r, £)-decodable. As mentioned in [4], many graph-theoretic 
properties exhibit a threshold behavior as follows. 

Definition 1: r(n) is called a threshold function for a graph 
theoretic property A if 

• When p(n) <C r(n), Hindoo Pr[G(n, n,p(n)) \= A] = 0. 

• When p(n) ^> r(n), lim^oo Pr[G(n, n,p(n)) \= A] = 1. 
Finally, given some event A whose probability depends on 

some parameter n, we say that A occurs almost always if 
li mrwoc Pr[A] = 1. 

III. The Constant Error Correction Case 

In this section we handle the case where both the column 
and row codes are capable of correcting a constant t number 
of errors. This is done by first noting that a bi-partite graph 
representing the transmission errors is decodable if and only if 
it does not contain a certain subgraph which we call an (r, t)- 
undecodable configuration. We then continue, using the theory 
of random graphs, to analyze the probability that the random 
bi-partite graph contains this undecodable configuration. We 
need the following definitions first. 

Definition 2: Let G = (V, E) be a graph, and let v G V be 
a vertex of the graph. We denote by Ni(v) the set of vertices 
of G which are reachable from v by a path (not necessarily 
simple) of length exactly i. 

We note that under this definition, a vertex of degree at 
least 1 is its own neighbor at distance 2, since we can take a 
path going over an outgoing edge, and returning by the same 
edge. In fact, such a vertex is its own neighbor for any even 
distance. We denote the degree of vertex v G V as d(v). 

Definition 3: Let G = (Vl,Vr^E) be a bi-partite graph. 
We say that G is an (r,t)-undecodable configuration if there 
exists v G Vl such that all the following hold: 

• [£ =0 N i (v) = V L \JV H . 

. For all < i < r - 1 and v' G N t (v), d(v') >t + l. 

We call v the root of G. 

Note that by the first requirement, the graph must be 
connected. Hence, the sets Ni(v) are not disjoint, and both 

N {v) C N 2 {v) C N 4 (v) C ... 



and 

N^v) C N 3 (v) C N 5 (v) C ... 

hold. Furthermore, if v 1 G Ni(v) for some i > 0, then its 
immediate neighbors are all in Ni + \(v). 

Definition 4: Let G = (Vl,Vr, E) be a bi-partite graph, 
and let H = (V[, V R , E') be another bi-partite graph. We say 
that H is an ordered bi-partite subgraph of G if there exist 
injective functions fa : V' L — ► Vl and /r : — > Vr such 
that if (v!,v 2 ) G E', then (f l{vi) , f r{v 2 )) G E. 

The following theorem is the basis for our analysis. 

Theorem 1: A bi-partite graph G = (Vl,Vr, E) is (r, t)- 
decodable if and only if it does not have an (r, t)-undecodable 
configuration as an ordered bi-partite subgraph. 

Proof: In the first direction, let us assume that G 
contains an (r, t) -undecodable graph H as an ordered bi-partite 
subgraph. Let v be the root of H. In the first round of decoding 
G, the vertices corresponding to N r -i{v) are not decoded 
since they have degree of at least t + 1. In the following round, 
if we take any vertex v' G N r - 2 (v), all of its neighbors are in 
N r _i(v) so they were not corrected in the first round. Since v' 
has degree at least t+1, it follows that the vertices of N r - 2 (v) 
are not corrected in the second round. Continuing in the same 
manner, after r rounds, the sole vertex of Nq(v) = {v} is not 
corrected, so G is not (r, t) -decodable. 

In the other direction, let G be a graph which is not 
(r, £)-decodable. Hence, after r rounds of decoding, there 
exists a vertex v € Vl which was not corrected. We now 
show that v is the root of an (r, t) -undecodable configuration 
H = (V[,V R , E') which is an ordered bi-partite subgraph of 
G. We start by obviously defining Nq(v) = {v} and taking 
V[ = {?;}, V' R = 0, and E' = 0. Now, since v was not 
decoded at the end of round r, round r — 1 ended with v 
having at least t + 1 undecoded neighbors. We denote this set 
of neighbors as Ni(v). We also add these neighbors to V R and 
the appropriate edges to E'. Take some v' G Ni(v). Since v' 
was not decoded at the end of round r — 1, round r — 2 ended 
with v' having at least t + 1 undecoded neighbors. Going over 
all possible v' G V R , and taking the union of the undecoded 
neighbors we get N 2 (v). Note that N (v) C N 2 (v). We add 
N 2 (v) to V' L and the appropriate edges to E'. Continuing in 
the same manner we get an (r, t) -undecodable configuration 
as the theorem states. ■ 

By the previous theorem, the question of undecodability 
becomes a purely graph-theoretic question. For the asymptotic 
analysis we need the following definitions and probabilistic 
tools. 

Theorem 2 (14]): Let X be a non-negative integral-valued 
random variable. If E[X] = o(l), then X = almost always. 
Proof: Trivial. ■ 

For the rest of this section, let X = X\ + ■ ■ ■ + X m where 
Xi is the indicator random variable for event Ai. For indices 
i, j, we write i ~ j if i ^ j and events Af and Aj are not 
independent. We define 

A = J^Pv[A i AA j }. 



Theorem 3 (Corollary 4.3.4, [4]): If E[X] — > oo and A = 
o(E[X] 2 ), then X > almost always. 

Definition 5 (14]): Let H be a graph with t> vertices and e 
edges. We call p = e/v the density of If. We call H balanced 
if every subgraph H' has p(H') < p(H). We call if strictly 
balanced if every proper subgraph if' has p(H') < p{H). 

The following is an adaptation of Theorem 4.4.2, [4], to 
bi-partite random graphs. 

Theorem 4: Let H be a balanced bi-partite graph with v 
vertices and e edges. Let G(n,n,p) be a random bi-partite 
graph, and let A be the event that if is an ordered bi-partite 
subgraph of G. Then p — n~ v l e is the threshold function for 
A. 

Proof: Let H = (V^,V R ,E') be a balanced bi-partite 
graph. Denote vl = \V L \, and vr = \V R \, so v = vl + Vr. 
Let G = (Vl, Vr,E) be a random bi-partite graph. Let S be 
a w-subset of the vertices of G such that \S HVl\ = vl and 
5* H Vh| = U -R- Let As be the event that the subgraph of G 
induced by S contains H as an ordered bi-partite subgraph. 
Then obviously, 

P e < Pr[As] < v L \v R l P e . 
Let Xs be the indicator random variable for As and 

X = J2Xs- 

s 

By linearity of expectation, 

E[X] =Y^E[X S ] = Pr[A s ] = 6(n> e ). 

If p(n) < then E[X] = o(l), so by Theorem|2] X = 

almost always. 

Now assume p(n) 3> n~ v l e so that f?[X] — > oo, and 
consider A of Theorem [3] 

A = £ Pr[A s AA t }=J2 PriM J2 P *1 A t\M- 

S^T S T^S 

Here, two w-sets S and T satisfy S ~ T if and only if S ^ 
T and they share some edges, i.e., |5nrnV£,| > 1 and 
|5 n T n Vh| > 1. Let S be fixed, so 

v-l 

Pr[A T \A s ] =J2 E P Mt\M- 

T~S i=2 \SnT\—i 

\SnTnv L \>i 
\snmv R \>i 

For each i there are 0(n v ~ l ) choices of T. Fix S and T, and 
consider Pr[Ar|^4s]- There are 0(1) possible copies of H on 
T. Since ff is balanced, each has at most — edges with both 
vertices in S, hence at least e — — other edges. Therefore, 

Pr[^ T |A s ] = 0(p e "^), 



and 

u-l 

^Pr[A T | J 4 s ] = ^ 0(n-y-*) 

u — 1 w — 1 

= ^ 0((nV) 1_ « ) = E °( ri > C ) = 

i=2 i=2 

since p(n) ^> n~ v l e . We have already seen that Pr[Ag] = 
0(p e ), and there are 0(n v ) choices for S, so 

A = J2?x[As}J2Pr[A T \A s } 

S T~S 

= 0(n v p e )o{E[X}) = o(E[X} 2 ). 

By Theorem [3] X > almost always. ■ 
Let us start by examining one specific type of an 
(r, £)-undecodable configuration. We define an exact (r, t)- 
undecodable tree as an (r, £)-undecodable configuration with- 
out cycles in which each vertex at distance at most r — 1 from 
the root has degree t + 1 exactly. It is easy to see that such a 
graph is strictly balanced. We denote the number of edges in 
such a tree as eT(r,i). This number is easily seen to be: 

, , [2r t=l 
e T (r,t) = i , _ r 1 (1) 

\(* + 1 )ifr 

Since this is a tree, obviously the number of vertices Vt(t, t) is 
exactly ey(r, f) + 1. By Theorem^ the threshold function for 
the existence of an exact (r, <)-undecodable tree in G(n,n,p) 
is 

p = n -( 1 +i?fety). (2) 

Another case is when the (r, i)-undecodable tree is not 
exact, i.e., the configuration is a tree, but some vertices at 
distance at most r — 1 from the root have a degree which is 
strictly more than t+1. However, in such a case, the existence 
of a non-exact (r, f)-undecodable tree implies the existence of 
an exact (r, i)-undecodable tree (simply trim the excess edges 
and vertices). 

Thus we are left with the case of (r, i)-undecodable config- 
urations which are not trees at all. Such configurations must 
contain cycles. It is also easy to see that such configurations 
must contain a simple cycle with at most 2r edges. 

If we take a graph of a simple cycle with e edges, it also 
has e vertices. This graph is also strictly balanced. It follows 
that the threshold function for the existence of such a cycle is 
p = nT 1 . Hence, when (0 holds, or when 

p < 7r( 1+ ^5^) , 

there are almost always no simple cycles of length at most 2r. 
This is because each length almost always does not appear, 
and there are 0(1) such lengths which interest us, so a simple 
union bound suffices. Thus, there are almost always no (r, t)- 
undecodable configurations with cycles in G(n,n,p) under 
these conditions. 



Corollary 1: The threshold function for the existence of an 
(r, i)-undecodable configuration in G(n,n,p), for some fixed 
r and t, is 

p = „-( 1+ 5?fcu), 

where ex(r, t) is given by Q. 

Now that we have established a threshold behavior for the 
existence of (r, i)-undecodable configurations, we are left with 
the case where 

-(i+ir-iVrf) 
p = c ■ n V Ti'i'i; j 

for some constant c > 0. We know that in this case, the ques- 
tion of the existence of an (r, t)-undecodable configuration in 
G(n, n,p) reduces to the question of the existence of an exact 
(r, t)-tree in G(n,n,p). 

For the following analysis we need the Janson inequality 
[5]. Let il be a finite universal set, and let R be a random 
subset of £1 given by 

Pr[r e R] = p r , 

where these events are mutually independent. Let Bi, i £ I 
be subsets of SI, where I is a finite index set. Let Aj be the 
event that Bi C R. Let Xi be the indicator variable for Aj, 
and let X = Yliei-^i- We denote the complementary event 
to Ai as A7. We set 



M 



n pr K 



Theorem 5 (The Janson Inequality, [5]): Let Aj, i € 7, A, 
and M, be as above, and assume that Pr[Aj] < e for all i £ I. 
Then 

M < Pr[A ie/ A~] < Me 1 ^. 

We can now continue by adapting Theorem 10.1.1, [4], to 
bi-partite graphs. 

Theorem 6: Let H be a strictly-balanced bi-partite graph 
with v vertices, e edges, and a automorphisms. Let c > be 
some constant. We denote by A the event that G does not have 
H as an ordered bi-partite subgraph. Then, when p = c-n~ v ^ e 
we have, 

lim Pr[G(n, n, p) \= A] = exp[—c e /a]. 

n — >oo 

Proof: Let H = (V[, V' R , E') be a strictly-balanced bi- 
partite graph. Denote vl = \V[\, and vr — \V R \, so v = 
v L + v R . Let B h l< i< (™ L )(™ R )v L \v R \/a, range over the 
edge sets of possible placements of H as an ordered bi-partite 
subgraph of G. Let Aj be the event that Bi C G(n,n,p). 

We use Janson's inequality from Theorem |5] For all i, 
Pr[A] = P e , so 

lim M = lim (1 -p e )(""J(4)^ !, ' R! / a = exp[-c e /a], 

since p = c • n~ v ^ e . We turn to handle 

A = ^Pr[AA^]. 

When i ~ j, let fc denote the number of vertices in the 
intersection of the two placements of H. Obviously 2 < k < v. 



Let fk denote the maximal number of edges in the intersection 
when i ~ j and there are fc vertices in the intersection. 
For k = v we obviously have f v < e since i ^ j. When 
2 < k < v — 1, since iJ is strictly-balanced and n is a 
subgraph of ff , 

l± < £ 

A; w 

There are 0(n 2v ~ k ) choices of i and j which intersect in k 
vertices. Hence, for each such i and j, 

Pr[A A Aj] = pl SiUS jl = p2e-|B in fl,| < p 2e-/ fc; 

and then 

A = J20{n 2v ^ k )0{n-^^ 2e - f -y). 



But 



fc=2 



2i;-A:--(2e-/ fe ) = ^-fc<0, 
e e 



so A = o(l). Janson's inequalities become a sandwich, so 

lim Pr[AiA 4 ] = lim M — exp[—c e /a]. 

n — >oo n — ^oo 

■ 

Fortunately, an exact (r, i)-tree is strictly balanced, and has 
the following number of automorphisms: 

aT(MH {(t + l)!(t!)^)^ T>1. (3) 

Thus we get the following corollary: 

Corollary 2: The probability that G(n,n,p) is (r, t)- 

f i-l i ) 

decodable when p = c ■ n \ "rM)^ is asymptotically 
exp[— c eT ( r ' t * > /a,T(r, t)], where er(r,t) is given by and 
ar(r,t) is given by (0. 

Finally, we summarize the case of constant error correction 
in Table E 

TABLE I 

The asymptotic probability of decoder failure with r rounds 

OF DECODING, A CONSTANT t OF DECODABLE ERRORS IN EACH ROW AND 
COLUMN, AND ERASURE PROBABILITY p 



p(n) 


Decodability 


fl+ 1 \ 


Almost always decodable 


(l+ 1 ) 
p = c-n < e r( r ''» 


Decodable with probability 

exp\-c eT( - r ' t '> /a T (r, i)} 


/i+ 1 ) 


Almost always undecodable 



IV. The Linear Error Correction Case 

We now turn to the case of linear error correction capabil- 
ities. This case appears to be much simpler than the previous 
one. Let F = {Ci, C2, . . . } be an infinite family of codes of 
ever increasing length, and let us denote the length of Ci by 
rii. We now require that Ci is capable of correcting arii errors, 



where < a < 1 is some constant. We note that "good" codes 
also fall into this category. 

Just like before, in this section we consider a product code 
having codewords of size n x n. For convenience, the row 
code and the column code are each capable of correcting an 
errors, where < a < 1 is a constant. We denote by p the 
erasure probability of the channel. 

The main tool for our analysis is the well known Chernoff 
bound. 

Theorem 7 (The Chernoff Bound): Let X — Y]f—i Xj be 
the sum of n independent random variables, each in [0, 8]. Let 
e £ (0, 1], and /i > 0, be fixed. Then, 

. If E[Xi] < n for all i, then Pr[X > (1 + e)/xn] < 

exp[— e 2 /in/ (3d)]. 
. If E[Xi] > fi for all i, then Pr[X < (1 - e)fin] < 
exp[—e 2 fj,n/(2S)]. 
We can now state the main result. 

Theorem 8: Let < p < 1 be fixed. Then, 

• If p < a then using only one round of decoding, i.e., 
only the row code, the decoder successfully decodes any 
received word almost always. 

• If p > a then no matter how many rounds of decoding 
are used, the decoder fails to decode any received word 
almost always. 

Proof: Assume p < a. Choose some fixed < e < 
a/p— 1. Using the Chernoff bound of Theorem any given 
row contains less than an errors with probability tending to 
1 exponentially fast. Hence, by a simple union bound, the 
probability of the event that there is some undecodable row 
in the first round, tends to exponentially fast. Hence, the 
first round of decoding successfully corrects all errors almost 
always. 

If p > a, choose some fixed < e < 1 — a/p. Just 
like the previous case, the probability that any given row or 
column contain less than an errors tends to exponentially 
fast by the Chernoff bound. Hence, given a received word, by 
a simple union bound, the probability that there is any row or 
column which is decodable tends to zero exponentially fast. 
This means, that almost always the decoder fails on all the 
rows and all the columns. Hence, no matter how many rounds 
are used, the decoding process fails almost always. ■ 

V. Conclusion 

We analyzed the asymptotic probability of decoding failure 
of iterative decoders for product codes. Our analysis is limited 
to the case of hard-decision decoding over the erasure channel. 
We examined both the case of codes capable of correcting a 
constant number of errors in each row and column, and the 
case of codes capable of correcting a constant fraction of the 
length of each row and column. 

In the case of constant error correction, the asymptotic 
probability shows a threshold behavior. As shown in Table U 
when the erasure probability decays faster than the threshold 
function, we can correctly decode every received word almost 
always. Conversely, when it decays slower, we almost always 
fail to decode any received word. When the erasure probability 



decays just like the threshold function up to a multiplication 
by a constant, we have an exact expression for the probability 
of decoder failure. It should be noted, that it is beneficial to 
invest in more rounds of decoding, and codes which correct 
more errors, since both ex(r,t) and ax(r, t) are increasing 
functions and ar(r, t) grows faster than er{r, t). This means 
that higher values of r and t give threshold functions closer 
to n^ 1 , and higher probability of successfully decoding when 
the erasure probability is at the threshold. 

The other case, of linear error correction capabilities, is 
perhaps more curious. Again we have a sharp threshold 
behavior, and in this case, it is constant. However, above this 
threshold, we almost always have too many errors for each 
row or column to correct and we fail, no matter how many 
rounds of decoding we do. Below this threshold, we almost 
always eliminate all the errors after the first round of decoding, 
so one round of decoding is enough. In that case, we do not 
get to use the column code at all, and the redundancy invested 
in it - is simply redundant. So it appears in that case, that it 
is better to use just the row code instead of the product code. 
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